Filtering Tandem Repeats in DNA Sequences
نویسندگان
چکیده
A tandem repeat is a sequence of two or more contiguous, approximate copies of a pattern. Tandem repeats occur in the genomes of both eukaryotic and prokaryotic organisms. They are important in numerous fields including disease diagnosis, mapping studies, human identity testing (DNA fingerprinting), sequence homology, and population studies. Although tandem repeats have been used by biologists for many years, there are few tools available for performing an exhaustive search for all tandem repeats in a given sequence. In this paper we describe a software tool that has been implemented as a postprocessing stage for a popular tandem repeats program. This new stage allows the program to scale up for use with whole genomic sequences. The program now organizes and filters the data into a meaningful and manageable set. The output is presented as a succinct table of repeats, including several relevant statistics for each repeat.
منابع مشابه
TRDB—The Tandem Repeats Database
Tandem repeats in DNA have been under intensive study for many years, first, as a consequence of their usefulness as genomic markers and DNA fingerprints and more recently as their role in human disease and regulatory processes has become apparent. The Tandem Repeats Database (TRDB) is a public repository of information on tandem repeats in genomic DNA. It contains a variety of tools for repeat...
متن کاملLarge-scale analysis of tandem repeat variability in the human genome
Tandem repeats are short DNA sequences that are repeated head-to-tail with a propensity to be variable. They constitute a significant proportion of the human genome, also occurring within coding and regulatory regions. Variation in these repeats can alter the function and/or expression of genes allowing organisms to swiftly adapt to novel environments. Importantly, some repeat expansions have a...
متن کاملHuman tandem repeat sequences in forensic DNA typing.
It has been 20 years since the first development of DNA fingerprinting and the start of forensic DNA typing. Ever since, human tandem repeat DNA sequences have been the main targets for forensic DNA analysis. These repeat sequences are classified into minisatellites (or VNTRs) and microsatellites (or STRs). In this brief review, we discuss the historical and current forensic applications of suc...
متن کاملA Visual Tool for Dna Repeats Localization
The detection of tandem repeats is important in biology and medicine as it can be used for phylogenic studies and disease diagnosis. A major difficulty in identification of repeats arises from the fact that the repeat units can be either exact or imperfect, in tandem or dispersed, and of unspecified length. This paper presents results obtained by combining grey level spectrograms with a novel n...
متن کاملDistributions of dimeric tandem repeats in non-coding and coding DNA sequences.
We study the length distribution functions for the 16 possible distinct dimeric tandem repeats in DNA sequences of diverse taxonomic partitions of GenBank (known human and mouse genomes, and complete genomes of Caenorhabditis elegans and yeast). For coding DNA, we find that all 16 distribution functions are exponential. For non-coding DNA, the distribution functions for most of the dimeric repe...
متن کامل